Combining Missing Data Imputation and Pattern Classification in a Multi-Layer Perceptron

نویسندگان

  • José-Luis Sancho-Gómez
  • Pedro J. García-Laencina
  • Aníbal R. Figueiras-Vidal
چکیده

Multi-Layer Perceptrons (MLPs) have been successfully applied in many pattern classification tasks. However, a drawback of these learning machines is that they cannot handle input vectors that present missing data on its features. A recommended way for dealing with missing values is imputation, i.e., to fill in missing data with plausible values. This paper presents a brief review of handling missing data, including the new Multi-Task Learning (MTL) systems. Moreover, an MLP approach for incomplete pattern classification based on MTL is proposed. This network learns in parallel the classification task (main task) and the different tasks associated to each incomplete feature (secondary tasks). During training, unknown values are imputed, being this missing data imputation process oriented by the learning of the classification task. Experimental results on five classification problems are given to show the effectiveness of the proposed approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Influence of Pattern of Missing Data on Performance of Imputation Methods: An Example from National Data on Drug Injection in Prisons

Background Policy makers need models to be able to detect groups at high risk of HIV infection. Incomplete records and dirty data are frequently seen in national data sets. Presence of missing data challenges the practice of model development. Several studies suggested that performance of imputation methods is acceptable when missing rate is moderate. One of the issues which was of less concern...

متن کامل

Missing Value Imputation with Unsupervised Backpropagation

Many data mining and data analysis techniques operate on dense matrices or complete tables of data. Realworld data sets, however, often contain unknown values. Even many classification algorithms that are designed to operate with missing values still exhibit deteriorated accuracy. One approach to handling missing values is to fill in (impute) the missing values. In this paper, we present a tech...

متن کامل

Incomplete Pattern Classification using a Multi-Task Approach

Missing data present a challenge to many pattern classification tasks. One of the most recommended ways for dealing with unknown values is missing data imputation. This paper presents an useful neural network approach that combines the classification and the missing data imputation using Multi-Task Learning. An effective cost function is also proposed that tends to provide imputed values for th...

متن کامل

Missing data imputation in multivariable time series data

Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...

متن کامل

Missing Data Handling in Multi-Layer Perceptron

Multi layer perceptron with back propagation algorithm is popular and more used than other neural network types in various fields of investigation as a non-linear predictor. Though MLP can solve complex and non-linear problems, it cannot use missing data for training directly. We propose a training algorithm with incomplete pattern data using conventional MLP network. Focusing on the fact that ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Intelligent Automation & Soft Computing

دوره 15  شماره 

صفحات  -

تاریخ انتشار 2009